NSF Project IIS - 0535168 : Separating Speech from Speech Noise Annual Report 2007
نویسندگان
چکیده
This year we continued our multisite collaboration aimed at improved understanding and modeling of how listeners handle noisy or corrupt speech. On the perception side, we have continued experiments on how listeners perceive distorted and incomplete speech tokens, including the use of our spondee/foil paradigm, in which listeners are played tokens derived from two-syllable compound words which are either relatively familiar spondees (e.g. “bankroll”), or pairs of monosyllable words less likely to be encountered together, but with the same initial consonant-vowel, and final vowel-consonant (e.g. “band hole”). Our manipulations consist of replacing parts of the original speech with short acoustic excerpts that provide reduced information, for instance by replacing a 200 ms chunk with 200 ms of white noise. In the spondee/foil paradigm, we generally replace a section of the speech centered on the consonant cluster between the two syllables, which tends to mask the distinction between spondee and foil. Increasing the temporal width of the replacement, and/or reducing the amount of information from the original speech preserved in the replacement, will lead to
منابع مشابه
An overview on automatic speech attribute transcription (ASAT)
Automatic Speech Attribute Transcription (ASAT), an ITR project sponsored under the NSF grant (IIS-04-27113), is a cross-institute effort involving Georgia Institute of Technology, The Ohio State University, University of California at Berkeley, and Rutgers University. This project approaches speech recognition from a more linguistic perspective: unlike traditional ASR systems, humans detect ac...
متن کاملCS 229 = = Final Project Report SPEECH & NOISE SEPARATION
In this course project I investigated machine learning approaches on separating speech signals from background noise. Keywords—MFCC, SVM, noise separation, source separation, spectrogram
متن کاملNSF-CAREER: The Listening Machine IIS-0238301 2003–2008 Final Report
This six-year project started with the idea of applying sound recognition and separation techniques that had originated in speech recognition to a broader domain of environmental sound mixtures. As it proceeded, the work diversified into several distinct areas, reflecting the different directions of the graduate students primarily supported by the project: Manuel Reyes and Keansub Lee worked on...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملSpeech Enhancement using Laplacian Mixture Model under Signal Presence Uncertainty
In this paper an estimator for speech enhancement based on Laplacian Mixture Model has been proposed. The proposed method, estimates the complex DFT coefficients of clean speech from noisy speech using the MMSE estimator, when the clean speech DFT coefficients are supposed mixture of Laplacians and the DFT coefficients of noise are assumed zero-mean Gaussian distribution. Furthermore, the MMS...
متن کامل